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Abstract 

Relay selection for cooperative communications promises significant performance improvements, 
and is, therefore, attracting considerable attention. While several criteria have been proposed for selecting 
one or more relays, distributed mechanisms that perform the selection have received relatively less 
attention. In this paper, we develop a novel, yet simple, asymptotic analysis of a splitting-based multiple 
access selection algorithm to find the single best relay. The analysis leads to simpler and alternate 
expressions for the average number of slots required to find the best user By introducing a new 
'contention load' parameter, the analysis shows that the parameter settings used in the existing literature 
can be improved upon. New and simple bounds are also derived. Furthermore, we propose a new 
algorithm that addresses the general problem of selecting the best Q > 1 relays, and analyze and 
optimize it. Even for a large number of relays, the algorithm selects the best two relays within 4.406 
slots and the best three within 6.491 slots, on average. We also propose a new and simple scheme for 
the practically relevant case of discrete metrics. Altogether, our results develop a unifying perspective 
about the general problem of distributed selection in cooperative systems and several other multi-node 
systems. 

Index Terms 

Relays, cooperative communications, selection, multiple access, splitting. 



V. Shah and N. B. Mehta are with the Electrical Communication Engineering Dept. at the Indian Institute of Science (IISc), 
Bangalore, India. R. Yim is with the Mitsubishi Electric Research Labs (MERL), Cambridge, MA, USA. 
Emails: {virag4u@gmail.com, nbmehta@ece . iisc . ernet . in, yimSmerl . com} . 
A portion of this work has appeared in the IEEE International Conference on Communications (ICC) 2009. 



November 23, 2009 



DRAFT 



1 

splitting Algorithms for Fast Relay Selection: 
Generalizations, Analysis, and a Unified View 

I. Introduction 

Selection mechanisms arise in many wireless communication schemes that use most suitable 
candidates from among a set of many candidates. A pertinent example is a cooperative commu- 
nication system that exploits spatial diversity by selecting the best relay(s) to forward a message 
from a source to a destination. Selection makes cooperation practical because it mitigates the tight 
synchronization that is required among many geographically distributed cooperating relays [1]- 
[11]. Another example is a cellular system that schedules in a proportional fair manner to the 
best mobile station based on the average data rate and the current state of the channel between 
the base station and the mobiles [12]. QoS requirements can also be incorporated in the selection 
metric, as is done, for example, in a wireless local area network (WLAN). In sensor networks, 
node selection is known to improve network lifetime. 

Several relay selection criteria have been proposed and analyzed in the literature. For example, 
[1] showed that for a decode-and-forward cooperation scheme, best relay selection achieves full 
diversity. In [3], criteria for selecting multiple relays were proposed to minimize data transmission 
time. In [10] relay subset selection was considered for rate maximization. In [6], best two relay 
selection was used to improve the diversity-multiplexing tradeoff of an amplify and forward 
protocol. In [7], multiple relay selection was optimized for cooperative beamforming. Multiple 
relay selection for wireless network coding was considered in [13]. 

The design of the mechanism that physically selects - as per the selection or suitability criteria 
- the best relay or, in general, the Q best relays is, therefore, an important problem. Depending 
on the transmission scheme, the suitability metric can be a function of both the source-relay 
and relay-destination channel gains or just the relay-destination or source-relay channel gains. 
It is desirable that the mechanism be distributed since, typically, the knowledge of the metric 
is initially available only locally at the relay. For example, a centralized polling mechanism for 
selection is undesirable as the time to select increases linearly with the number of available 
relays. To this end, a decentralized back-off timer-based scheme for single best relay selection 
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were proposed in [1]. In it, each node transmits a short message when its timer expires. Making 
the timer value inversely proportional to the metric ensures that the first node that the sink hears 
from is the best node. A distributed single relay selection algorithm was also proposed in [14] 
to minimize the bit error rate. In [15], the source uses handshake messages from relays to track 
the rate that each candidate relay can support. 

An alternate approach considers a time-slotted multiple access contention based algorithm in 
which each active node locally decides whether or not to transmit in a certain time slot. Recently, 
variations based on splitting algorithms, which were extensively researched two decades ago for 
multiple access control [16, Chp. 4], have been proposed for single relay selection [17], [18]. In 
each step of the splitting-based selection algorithm proposed in [17], only those nodes whose 
metrics lie between two thresholds transmit. The nodes update the thresholds (independently) in 
each slot based on the outcome of the previous slot fed back by the sink.^ It was shown in [17] 
for continuous metrics that the best node can be found, on average, within at most 2.507 slots 
even for an infinite number of nodes. This result was obtained by deriving an upper bound on 
the average number of slots when the number of relays tends to infinity. However, the analysis 
was quite involved and the upper bound was in the form of an infinite series. 

While distributed selection mechanisms have proposed for single relay selection, several ques- 
tions remain open. For example, developing a comprehensive analysis of the splitting mechanism 
is an important problem. A natural question that such an analysis will answer is how to optimally 
choose the thresholds to improve the speed of selection. In [17], the thresholds are initially set 
greedily so to maximize the probability of success. As we show, this is not optimal. Furthermore, 
efficient mechanisms are yet to be developed for multiple relay selection. The only option known 
currently is to run the single relay selection algorithm multiple times, which, as we show in 
this paper, is inefficient. Finally, the mechanisms above assume that the selection metric is 
continuous, and exploit the fact that, with probability 1, no two relays have the same metric. The 
mechanism catastrophically breaks down when the metrics are discrete, which can often occur 
in practice. This occurs, for example, when the estimation inaccuracy renders higher resolution 
representations unnecessary, or when quantized metrics for feedback or QoS are considered [11], 

'We use the generic term 'sink' to refer to the source or access point or base station, as the case may be, that needs to select 
the best node/relay. 
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[19]. 

This paper thoroughly examines splitting-based selection algorithms for both continuous and 
discrete metrics, and makes the following significant contributions: 

• Analysis of single relay selection: The paper develops a novel and considerably simpler 
exact asymptotic analysis for a general version of the splitting algorithm. It achieves this 
by developing a different Poisson process interpretation of the metric distribution, which 
has not been used before to the best of our knowledge. Furthermore, it also derives a new 
convex and simple upper bound for the average number of slots required to select the best 
relay. 

• Optimization of single relay selection: The paper analytically determines the optimal perfor- 
mance of the splitting algorithm. It also rigorously shows that the greedy parameter choice 
of [17] is sub-optimal, but is still very good. 

• An alternate Markovian analysis: The Poisson process interpretation also leads to an alter- 
nate and novel Markovian analysis, which among other things yields a new exact asymptotic 
expression for the average number of slots. As we shall see, while the two new expressions 
derived in this paper are equivalent, they exhibit different behaviors when truncated. 

• New mechanism for multiple relay selection, including its analysis and optimization: The 
paper proposes a novel scalable, fast, and decentralized algorithm for the general problem 
of selecting not just the single best but the best Q > 1 relays. To the best of our knowledge, 
this is the fastest family of Q relay selection algorithms proposed to date. We develop an 
asymptotic analysis of the general Q relay selection algorithm, and determine its optimal 
parameters. We show that as Q increases, the greedy parameter choice becomes more 
suboptimal. In effect, as Q increases, the optimal splitting algorithm prefers that more 
nodes collide since it is faster to resolve a collision than avoid one. 

• Unifying perspective: The paper shows that the optimized best relay selection algorithm, 
the proposed multiple relay selection mechanism, and Gallager's First Come First Serve 
(FCFS) multiple access control algorithm [16] are intimately related. 

• New scalable algorithm for discrete metrics: Finally, the paper proposes a novel, scalable, 
and an intuitive distributed scheme called Proportional Expansion, which enables the single 
and multiple relay selection algorithms to be applied to the practical case of discrete metrics. 
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The rest of the paper is organized as follows. The analysis and results for single best node 
selection is developed in Sec. HIl The new algorithm for Q > 1 node selection is proposed, 
analyzed, and simulated in Sec. Hill We conclude in Sec. HVl Several mathematical proofs are 
relegated to the Appendix. 

II. Single Relay selection 

A. System Setup 

Consider a time-slotted system with n active nodes and a sink, as shown in Fig.[TJ Each node i 
has a suitability metric Ui, which is known only to that specific node. In this section, the goal is to 
select the node with the highest metric. The metrics are continuous and i.i.d. with complementary 
CDF (CCDF) denoted by Fc{u) = Pr(nj > u). Therefore, the -Fc(-) is monotonically decreasing 
and invertible. (The discrete metric case, where this is not so, is tackled in Sec. IIII-FI ) 

B. Splitting Algorithm: Brief Review and Notation 

We now formally define the splitting algorithm for single relay selection. To keep the treatment 
concise, we first define the state variables maintained by the algorithm and their initialization. 
Thereafter, we describe how the algorithm controls the transmissions of the nodes, how the sink 
generates feedback based on these transmissions, and how the state variables get autonomously 
updated based on the feedback. 

Definitions: The generalized best relay selection algorithm is specified using three variables 
Hi{k), Hnik) and H^i^^k); the notation being consistent with that in [17]. Hiik) and Hnik) 
are the lower and upper metric thresholds such that a node i transmits at time slot k only if its 
metric Ui satisfies Hiik) < ui < Hnik). H„im{k) tracks the largest value of the metric known 
up to slot k above which the best metric surely lies. 

Initialization: In the first slot {k = 1), the parameters are initialized as follows: Hl{1) = 
F~^{pe/n), Hh{1) = oo, and i/mm(l) = 0. Here, Pe is a system parameter, and shall henceforth 
be referred to as the Contention load parameter. 

Transmission rule: At the beginning of each slot, each node locally decides to transmit. As 
mentioned, it transmits if and only if its metric lies between HL{k) and Hnik). 

Feedback generation: At the end of each slot, the sink broadcasts to all nodes a two-bit 
feedback: (i) if the slot was idle (when no node transmitted), (ii) 1 if the outcome was a 
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success (when exactly one node transmitted), or (iii) e if the outcome was a collision (when 
multiple nodes transmitted).^ 

Response to feedback: We first define the split function^ to facilitate description: Let split (a, h) = 
p-i ^ Fe(a)+Fc(&) j xhen^ depending on the feedback, the following possibilities occur: 

1) If the feedback (of the k^^ slot) is an idle (0) and no collisions has occurred so far, then 
set HH{k + 1) = HL{k), H^ik + 1) = F-\^p,), and H^^{k + 1) = 0. 

2) If the feedback is a collision (e), then set Hiik + 1) = split {H L{k), Hnik)), Huik + l) = 
Hnik), MidH^,,{k+l)=HL{k) 

3) If the feedback is an idle (0) and a collision has occurred in the past, then set Hnik + l) = 
HL{k), HL{k + 1) = split (i/^i„(A;), HL{k)), and H^,^{k + 1) = H^,,{k). 

Termination: The algorithm terminates when the outcome is a success (1). 

We shall call the durations before and after the first non-idle slot as the idle and collision 
phases, respectively. Thus, the contention load parameter, p^, is the average number of users 
that transmit in a slot in the idle phase. The Qin-Berry algorithm [17] uses pe = I, which is the 
value that maximizes the probability of a success outcome in an idle phase slot. 

C. Main Analytical Results 

The floor and ceil operations are denoted by [.J and [.] , respectively. E [Z] will denote the 
expected value of a random variable Z. 

We now develop a new analysis of the average time taken, mn{pe), by the splitting algorithm 
to select the single best relay. The following lemma gives an exact expression for m„(pe)- 

Lemma 1: Let Xk be the number of slots required to resolve a collision among k nodes. 



Let q 



Pe 



1 denote the idle phase duration in slots. The average number of slots, niniPe), 



required to find the best node is given by, 
mn{Pe)=J2J2[l)(-) (^--1 (E[X.] + z) + fl-^) (E[X„]+g + l), (1) 



i=l k=l 



n / \ n \ n 



^The sink can distinguish between these outcomes using, for example, the strength of the total received power [20]. 

^The split function makes sure that on an average half of the nodes involved in the last collision transmit in the next slot. 
Splitting can be made faster as was done in [21]. However, doing so requires each node to numerically calculate thresholds in 
each slot that are solutions of degree n — 1 equations. Also, the improvement due to this scheme turns out to be less than 0.5%. 
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where E [X^] follows the recursion E [XJ = for all k > 2, and E [Xi] = 0. 

Proof: The proof is given in Appendix |Al ■ 
The above expression is complex and does not directly reveal the scalable nature of the 
algorithm. The theorem below provides two equivalent and new expressions for the asymptotic 
case (n oo). 

Theorem 1: The average number of slots required to find the best node as n — oo is given 
by following two different yet equivalent expressions. 

1) Recursive expression: 

k=l 

2) Non-recursive expression: 

^ oo 

m^{pe) = — + VpIO. (3) 

i — e ^ — ' 

1=1 

where p(i) = (1 - p„) o'lid " p,). p» = ^ = ^ > ^ i- 

Proof: We show the proof for the recursive expression in Q below as it leads to a powerful 
new Poisson point process interpretation that will be useful throughout this paper. For example, 
it will lead to the derivation of the non-recursive expression in ([3]), whose proof is relegated to 
Appendix |Bl The physical meaning of p{i) and Pj will become clear after the proof. 

Let node i have metric Ui with CCDF Fc{u). Let yi = nFc{ui). Then, yi are i.i.d. and are 
uniformly distributed in [0,72]. Note that selecting the node with the highest Ui is equivalent to 
selecting the node with the lowest yi because the CCDF is a monotonically decreasing function. 
Sorting in ascending order, we get < ypj < y[z] ■ ■ ■ < y[n], where [i] is the index of 

the relay with the i^^ largest metric. 

Given yi, we can define a point process [22] M(t) as M(t) = max {A; > 1 : y[k] < t}. Thus, 
M(t) is the number of points that have occurred up to time t. Since are i.i.d. and 

uniformly distributed, M{t) is binomially distributed. As n ^ oo, it can be shown that M{t) 
forms a Poisson process with rate 1 [22]. Now, the probability that the first non-idle slot is the 
i^^ slot and k > 1 nodes are involved is equal to the probability that y^^, . . . ,y]j.-\ lie between 
[i — l)pe and ipe, and yyj > ipe, for k + 1 < j < n. It also implies that no points lie between 
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and (i — l)pe- Therefore, 

Pr xri] > (? - 1) — , {t - 1)— < X[k] < ,X[fc+i] > I— 
\ n n n n 

= Pr (M((^ - l)p,) = 0, M{tp,) = k) , 

= Pr {M{{i - = 0) Pr iM{tp,) = k |M((^ - l)p,) = 0) , 

k k 
n^oo k\ k\ 

Here, (a) follows from the memoryless property of the Poisson process [22]. Recall that E [Xk] 
is the expected number of slots required to resolve a collision among k nodes. Thus, if the 
first non-idle slot is the i^^ slot and k > 1 nodes are involved, then E [Xk] + i slots are 
required to find the best node. Also, as n — oo, qpe/n 1. Hence, we get modPe) = 
Xli^i YlT=i ^ ^^"^ (-^ l^k] + i)- The desired result follows with the help of combinatorial iden- 
tities [23]. ■ 

The main theorem readily gives rises to the following upper bound expression that does not 
involve an infinite series. 

Corollary 1: For any real /cq > e/2, 

niooiPe) < , + log2 ( — ) + ^— . (5) 



A;olog,(2) V e y 1 - e-P- 

Proof: The proof is given in Appendix O ■ 

Alternatively, since both the expressions derived in Theorem \T\ involve only positive terms 

in the series summation, considering only the first few terms of the infinite series in (O and 

^ results in tight lower bounds near the optimal contention parameter value. These simplified 

expressions allow system designers to quickly compute the necessary parameters for system 

optimization. As we shall see, their behavior turns out to be quite different and sheds light on 

the differences between the two equivalent expressions derived in Theorem [TJ 

D. Results for Single Relay Selection 

Figure [21 plots the average number of slots required to select the best node as a function of 
Pe for the two expressions and verifies them using Monte Carlo simulations. It can be seen 
that the asymptotic expression is accurate even when the number of relays is small (e.g., 10). 
Furthermore, the optimal value of moo(pe) is 2.467, and occurs at pe = 1.088. As expected, 
the optimal pe does not exceed 2. This is because having more than two nodes on average to 
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transmit and collide in a slot is suboptimal. We also observe that rriaoipe) at Pe = 1 is quite 
close to the optimal value. 

Figure |3] plots the upper bound using = 2. As expected, it has a unique minimum and 
follows the behavior of the exact expression well in the region of interest of p^. The same 
figure also compares the lower bounds obtained using the first 4 terms of both the expressions 
of Theorem \T\ For higher values of pe, the lower bound obtained by truncating the recursive 
expression in Q does not capture the behavior of the exact expression well. This is because of 
the truncation, on account of which the possibility that a large number of nodes collide in the 
first non-idle slot is not accounted for. This probability is not negligible for larger pe. However, 
the lower bound obtained by truncating the non-recursive expression in (O does better at larger 
Pe because the summation in the series is over the number of slots required after the first non-idle 
slot and not over the number of nodes that collided in the first non-idle slot. 

in. Q-Relay Selection Algorithm 

We now develop a new family of splitting algorithms for selecting the relays with the Q best 
(highest) metrics, where Q is a pre-specified system parameter. The value of Q depends on the 
system under consideration. For example, in [13], M — 1 relays need to be selected to forward 
the transmissions by M sources. The choice of Q, which is beyond the scope of this paper, 
is ultimately governed by the end-to-end system performance and practical constraints such as 
the synchronization requirements across the selected relays. For example, while having more 
cooperative relays improves the reliability or speed of transmission of data to the destination, 
selecting them will also require the system to expend additional resources. The reader is referred 
to [3], [7], [10], [24] for a detailed discussion on this aspect. 

A. Algorithm Motivation and Definition 

When we revisit the asymptotic regime considered in the previous section, we observe the 
following. The single node selection algorithm, in effect, runs the FCFS algorithm [16] on the 
Poisson point process M(t) defined in Theorem [U with t being interpreted as time. However, 
unlike FCFS, the single relay selection algorithm stops as soon as it finds the first (best) node. 
In this context, the parameter pe is analogous to FCFS's initial contention interval. 
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Based on the above insight, we now formally state the new multi-relay selection algorithm 
given any Q. We then explain the logic behind it and fully analyze it. For this, we adopt the 
notation used for FCFS in [16], as it turns out to be more convenient. 

As in Sec. HIl let Ui = nFc{ui). The algorithm specifies four state variables S{k), T{k), a{k), 
and a{k) for each slot k. S{k) is the number of nodes selected before slot k. (T(/c), T(k) + a{k)) 
represents the threshold interval for slot k, i.e., all the nodes with i/i E {T{k),T{k) + a{k)) 
transmit in slot k. (Equivalently, Hnik) = F'^ {T{k)/n) and HL{k) = F'^ {{T{k) + a{k)) /n). 
a{k) G {L, R] indicates whether the k^^ slot interval is the left half or the right half of the 
previously split interval. During initial slots, when no collision is to be resolved, a{k) = Rhy 
convention. Thus, for A; = 1, we have S'(l) = 0, T(l) = 0, a{l) = p^, and a{l) = R. 

In the {k + 1)*^ slot {k > 1): 

1) If feedback is a collision (e), then T(A; + 1) = T{k),a{k + l) = a{k) /2, and a{k + l) = L. 

2) If feedback is a success (1) and a{k) = L, then T{k + 1) = T{k) + a{k), a{k + l) = a{k), 
and cr{k + 1) = R. 

3) If feedback is an idle (0) and cT(fc) = L, then T(A; + 1) = T{k) + a{k), a{k + l) = a{k)/2, 
and a{k + 1) = L. 

4) If feedback is an idle (0) or a success (1), and a{k) = R, then T{k + 1) = T{k) + a{k), 
a{k + 1) = Pe, and a{k + 1) = R. 

5) Increment S{k + 1) by 1 if feedback is a success (1). Terminate if S{k + 1) reaches Q. 

B. Brief Explanation 

The logic behind the algorithm is as follows: (i) When a collision occurs, the threshold interval 
for the next slot is the left (L) half of that of the present slot, (ii) When a collision occurs, the 
threshold interval must have at least 2 nodes. Thus, when a success follows a collision, the 
threshold interval for the next slot is the right (higher) half (R) of the previous slot, since it 
is known to have at least one node, (iii) When an idle follows a collision, it implies that all 
the nodes involved in collision lie in the right half of the previous split interval. Thus, it is 
further split it into two equal halves, and the threshold interval for the next slot is the left half 
of this split, (iv) When there is no collision to be resolved, the algorithm moves to the adjacent 
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threshold interval (which we call as collision resolution interval) of size Pe-'^ As mentioned above, 
the algorithm terminates after the Q successes. 

Comments: The proposed algorithm is equivalent to the algorithm of Sec. |II] when Q = 1. 
It is similar to FCFS, except that it stops after the Q*^ success. There is one subtle difference, 
however, between the algorithm and FCFS. In FCFS, the contention resolution interval can be 
smaller than pe if the difference between the current time and the time of the last resolved 
interval is small. However, this does not happen in our algorithm (step 4) because all the nodes 
know their individual metrics a priori. Notice that the algorithm is greedy in that it does not 
account for possible interactions between metrics of the relays. However, such a greedy approach 
has often been used given its inherent distributability [13].^ 



C. Algorithm Analysis: Best Two Nodes Selection 

First, we analyze the algorithm for selecting the best two nodes using the Poisson point 
approach that came out of Sec. [Ill This will lead to an analysis for the general Q > 2 node 
selection case. The Q = 2 analysis is shown separately as it turns out to be richer. 

Let m^\pe) represent the average number of slots required to select the best Q nodes. 
Thus, the symbol moo{pe), which was used in the previous section on single relay selection, 
is equivalent to m^^(pe)- The following theorem gives two different but equivalent and exact 
expressions for m^^{p(,). 



Theorem 2. Let E 



(Q) 



denote the average number of slots required to select the best Q 



nodes after k nodes collide. As n — oo, (pe) is given by 



oo 
k=l 



E \X. 



(2) 



Ve 



k\ 



+ 



where 



E 



X 



(2) 



(2'-2)" E 



fc-1 



i=2 



E 



X) 



(2) 



A: 1 + E 



(6) 



2M , V A; > 3, (7) 



''We can relax the restriction thiat eachi collision resolution interval is of length p^. However, it can be shown that doing this 
leads to a negligible improvement. 

more general version of the algorithm would allow for the metrics to be modified on the basis of the relays that have 
already been selected. Developing such an algorithm is an interesting avenue for future work, and would find several applications, 
such as in the time-sharing proportional fair solution of [25]. 
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E 



X. 



(2)1 



3, and E 



(1)/ 



Alternately, m^''(pe) also equals 

'^'HPe) = ^ + Pomg\p,) + ^ {p{^)+p\i) +p"(z + 1)) 



ml 



(8) 



i=l 



Where = (I-Pq) n;=i(l-^L,,), V z > 1, = V z > 1, /(2) = -Pr,i), 

and /(O = - 1)(1 - Pk,.-i) + /(^ - 1)(1 - Pl,^-i), V z > 2. Here, Pq = Pl,^ = 



l_e-2 »Pe ' — 



l-(l+2-(»-i)pe)e-2" 

Proof: The proof is given in Appendix iDl It also gives a physical meaning for p{i), p'{i), 
p"{i), PL,^, and PR,i. ■ 



D. Algorithm Analysis: Best Q > 2 Nodes Selection 

We now derive a general expression for m~^\pe) for any Q > 2 This generalizes the first 
result of Theorem [2l 

Theorem 3: As n — > cxo, the average number of slots required to select the best Q > 2 nodes 

is 



m^^\pe) 



1 



fc=l 



E U 



Pe 



1 



A;! 



1 - e-P- 



(9) 



where 



E 



(Q) 



fc-1 



j=2 



E 



X. 



(Q) 



E 



X, 



(Q) 



mS"')(pe) + 3, Vg>2, E 



X 



(2) 



+ A; (^1 + E 
= 3, and E 



j +2' h VA;>3, (10) 
Xi^^] = m^^~'\p,). 



Proof: The proof is given in Appendix |El ■ 
A non-recursive expression for m'^\pe) for Q > 2 along the lines of (|3]) of Theorem [T] and Q 
of Theorem |2] can be derived. However, the Markov chains become more involved. 

E. Results for Q Best Relay Selection 

Figure |4] plots m^''(pe) as a function of pe using Theorem [21 and verifies it using Monte 
Carlo simulations. It can be seen that the asymptotic expressions are accurate even for a small 
number of nodes, e.g., n = 20. The lowest average number of slots required to select two users 
is 4.406, which occurs at pe = 1.221. This is 10.7% faster than running the single relay selection 
algorithm twice, which requires 2 x 2.467 = 4.934 slots. The increase in the optimal Pe from 
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1.088 slots for Q = 1 to 1.221 slots for Q = 2 occurs because now it is faster to resolve a 
collision than to avoid it. Specifically, the time taken to select two nodes given that they are 



involved in a collision is E 



^2 



(2) 



3.0 slots. Where as, the number of slots required to select 
two nodes, given that the previous slot was idle, is 4.4 slots. 

Table H] provides the optimum values of pe and the average number of slots as a function of 
the number of relays that need to be selected. We can see that selecting the best three nodes 
takes 6.491 slots, on average, and is achieved when = 1.214.^ As Q — > oo, the optimum 
value of Pe increases to 1.266, which is also the optimum value maximizing the throughput of 
FCFS [16].' Also, it can be shown that — tot^ , which represents the average number of users 

m^'{1.266) 

selected per slot by the algorithm for p^ = 1.266, increases to 0.487 as Q — oo. 

F. Tackling Discrete Metrics Using Proportional Expansion 

The thresholding algorithms in Sec. III-AI and Sec. IIII-AI exploit the critical fact that with 
probability one no two metrics are equal. However, as mentioned in the Introduction, when the 
metric has a discrete probability distribution, the algorithms break down because the probability 
that the metrics of the best two nodes are exactly equal is non-zero. We now provide a simple 
and novel distributed solution called Proportional Expansion to tackle this practical problem. 

Proportional Expansion: Let the metric Ui be a realization of an cu-valued discrete random 
variable that, without loss of generality, takes values 1, 2, . . . , cij with probability pi, p2, . . . , p^,, 
respectively. Each node independently maps its metric Ui into a new metric z/j as follows: When 
Ui = j, is a realization of a uniformly distributed random variable in Pt^^\=iPt^^ 
where po = 0. In other words, each node chooses a new random metric z/j that is uniformly 
distributed over a bin of length proportional to the probability mass of its original metric Ui. 

The overall distribution of the new metric across all users is then uniformly distributed in 
(0, 1). Proportional Expansion satisfies two key properties: 

'The marginal decrease in the optimal value of pe from 1.221 to 1.214 when Q increases from 2 to 3 can be explained 
as follows. The time taken to select three nodes after a collision among two nodes is E |^^2'^'j ~ 5.48 slots. However, the 
number of slots required to select three nodes after an idle slot, is 6.49 slots, which is just 17.8% more than 5.48. Therefore, 
the optimum pe decreases since the selection times after an idle and a collision are not as unequal as for Q — 2. 

^The maximum arrival rate of 0.487 is supported when initial collision interval is capped at 2.6. This implies that there are 
on average 0.487 x 2.6 = 1.266 nodes transmitting. The contention parameter pe is set using normalized metric CCDF with an 
'arrival rate' equal to 1. 
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• It preserves the sorting order of the metrics: if Uj > uj, then Ui > Uj. Hence, selecting the 
best Q nodes with the highest z/jS is equivalent to selecting Q nodes with the highest WjS. 

• The probability that Ui = Uj, for z 7^ j, is since Ui is a continuous random variable. 
Therefore, the selection algorithm of Sec. UlI] for any Q > I can then be run on z/j. The 

following Proposition formally quantifies the performance of Proportional Expansion. It implies 
that proportional expansion is scalable, i.e., it takes at most 2.47 slots for best relay selection, 
4.406 slots for selecting the best 2 relays, and so on, for any number of relays, n. 

Proposition 1: The average number of slots required to select the best Q relays by Proportional 
Expansion for the discrete metrics case is the same as that of the best Q relay threshold based 
selection algorithm of Sec. IIII-AI that operates on continuous metrics. 

Proof: The proof is omitted since it directly follows from the above discussion. ■ 

IV. Conclusions 

We developed a new asymptotic analysis for the single relay splitting based selection algorithm, 
which was based on a new Poisson point process interpretation of the dynamics of the algorithm. 
This led to a characterization of the optimal parameters of the algorithm, and enabled a rigorous 
benchmarking of the greedy parameter setting used in the literature. We also proposed a new 
splitting based algorithm for selecting the best Q relays, which are useful for several cooperative 
protocols proposed in the literature. The new algorithm was more efficient than running the single 
relay selection algorithm multiple times. Furthermore, we generalized the analytical techniques 
to handle multiple relay selection, and derived the exact expressions for the average number of 
slots for multiple relay selection. Interestingly, the asymptotic expressions were accurate even 
for a small number of relays. With the help of proportional expansion, we showed, for the first 
time, that splitting algorithms can be adapted to work for discrete metrics as well without any 
loss in performance or scalability whatsoever. 

The analysis shows that the greedy policy of maximizing the success probability in the next 
slot is suboptimal. While it works well for single relay selection, it becomes more and more 
suboptimal as the number of relays to be selected increases. The analysis also shows that the 
general single relay selection algorithm, the proposed multiple relay selection algorithm, and 
the FCFS multiple access control algorithm are intimately related. For example, the optimal 
value of the contention load parameter increases as the number of relays to be selected increases 
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and finally approaches the optimal setting for FCFS. This is despite the fact that selection 
and multiple access control algorithms serve very different purposes, and, therefore, evaluated 
differently. While multiple access control algorithms attempt to serve all nodes and are evaluated, 
for example, by the maximum traffic they can handle with a finite delay, selection algorithms 
are evaluated by how fast they can select the best nodes. We hope that this insight will help 
develop better selection algorithms. An important property about splitting algorithms is that 
besides being distributed, they are both extremely fast and scalable. This suggests that selection 
based protocols will deliver improvements in the overall end-to-end system-level performance 
even when the time overhead incurred by the selection algorithm is accounted for. The system- 
level benefits can be further improved if the multiple relay selection algorithm proposed in this 
paper can be modified to allow the metrics to be updated during the selection process. 

Appendix 

A. Proof of Lemma [7] 



1 slots since at 



It can be easily seen that the idle phase consists of at the most q = 
this stage the lower threshold equals the smallest value 0. Given that the first non-idle slot is 
the i^^ slot and k nodes are involved, the average number of slots required to find the best node 
is E [Xk] + i. (The recursive expression for E [Xk] is given in [17, (6)].) The probability that 
the first non-idle slot is the i^^ slot and k nodes transmit in it equals (^) (^)'^ (l — for 
i < q. This constitutes the first term of the right side of ©. The probability that the (q + iy^ slot 
is the first non-idle slot is (1 — since all nodes' metrics must lie in interval {{q + l)pe, !]• 
In the event that this happens, all n nodes will transmit and collide, which will take E [X„] slots 
to resolve. Hence, the second term on the right side of ([T]) follows. 

B. Proof of Non-Recursive Expression of Theorem\l\ 

Let the random variable / denote the number of slots required until (and including) the first 
non-idle slot and Y denote the number of slots required after that. 

Consider the state transition diagram of Figure |6l in which the state represents the number 
of slots that have elapsed since the first non-idle slot. The node goes to state S whenever 
success occurs, and the algorithm terminates. Otherwise, in case of an idle or collision, the 
node increments its state by 1. By definition, state is the first non-idle slot itself; thus, an idle 
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outcome cannot occur in it. The following lemma is crucial in analyzing this transition diagram. 



Lemma 2: The state transition diagram of Fig. [6] is a Markov chain. 

Proof: To prove this, it is sufficient to prove that the transition probability from any state 
z to 5 is dependent only on i. (Having done so, we shall denote this probability by P^.) 

We refer to the interval in M (t) allocated to state i as its threshold interval. Here, Pq is the 
probability that in a threshold interval of size Pe only one node transmits given that at least one 
node transmits in that slot. Let N(x) = M{t + x) — M(t). Then, from the memoryless property 
of the Poisson process, Pr(A^(x) = i) is independent of t and is equal to ^^-fr— • Thus, 

PO = Pr (iV(pe) = l|iV(Pe) > l) = Y^^.- (^^^ 

Pi is the probability that the second non-idle slot is a success given that the first non-idle slot (of 
threshold interval size pe) is a collision. Due to splitting, the second slot will have a threshold 
interval size that is half that of the first one. Therefore, Pi is the probability that conditioned on 
N{pe) having at least 2 nodes (i.e., a collision), N{pe/2) has exactly one. Thus, 

\ Pr (iV (^) = l,N(p,) > 2) 
- = PriNM> 2) ■ 

Therefore, 

p ^ Pr(iV(f) = l,iV(Pe)-iV(f)>l) ^ fe-^(l-e-^) 
^ Pr{N{pe) > 1) 1 - {1+Pe)e-P- ' 

For P2, the following two trajectories can occur: State 2 was reached by a collision in state 1 

or by an idle in state 1 . In case of a collision, the threshold interval of the second non-idle slot 

(of size Pe/2) gets split into two halves. Even in the case of an idle the interval would be split 

into two halves and nodes from the left half would contend. Thus, P2 is equal to the probability 

that conditioned on an interval of size pe/2 having at least two nodes, half the interval (of size 

Pe/4), has exactly one node. Thus, 



A = Pr(iV(|) =1 



P2=Pr(iV(| 



2J - J Pr{N{p,) > 2) 1 - (1 + ^ 



e 2 



In similar way, we can show that Pj, V i > 1, is equal to the probability that conditioned on 
an interval of size 2~(*~^)pe having at least two nodes, one half of the interval (of size 2^*pe) 
has exactly one node. Thus, 



P=Pr(iV(| 



^ -^-1; ^ J 1 - (1 + 2"{-i)pe)e-2-''-^>P^ ^ ^ 
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Now, rriooiPe) = E [/] +E [Y]. From the Poisson process interpretation of Theorem [H we can 
show that Pr {I = i) = e~^'~^^P-{l-e~P-). Therefore, E [/] = ie-('"^)P^(l-e"P'=) = jzh^- 
The average number of slots required after the first non-idle slot to select the best node, E [Y] , 
is calculated as follows. First, E [Y] = Xli^i ^Pr(^ = i), can be shown to be identically equal 
to Xli^iP^(^ — Second, since each state in the Markov chain is visited at most once, it 
follows that E [Y] = J^'^iPi^)^ where p{i) is the probability that the z*'^ state is visited. From 
the state transition diagram, it is easy to see that p{i) = (1 — Pq) YYj=i{^ ~ ^i) ■ Hence, the 
desired expression for ■m^{pe) follows. 

C. Proof of Corollary [7] 

From [17], we have E [Xk] < \og2{k) + 1, k > 2, and E [Xi] = 0. Since log2(a;) is concave 
with respect to x, a tangent to it at any point {ko, log2(A;o)) is an upper bound. Therefore, 

l0g2(fc)< +log2(fco)- (15) 

ko loge(2) 



Consequently, E [X^] < ^^^^^ + 1082(2^0/^), k > 2. Substituting this in we get 

For ko > e/2, log2(2A;o/e) > 0. Also, § = e^-^ - 1 - < e^- - 1 since Pe > 0. 

Therefore, for ko > e/2, the first term in the right hand side of (fT6l) is less than log2 (^)- 
Substituting this inequality in (fT6l) and simplifying leads to the desired result in ©. 

D. Proof of Theorem |2] 

Proof of Given that the first non-idle slot is the z*^ slot and k > 1 nodes are involved, the 



average number of slots required to select the best 2 nodes is E 



4^' 



i. The probability that 



the first non-idle slot is the i slot and k > 1 nodes are involved is e '•^"p^/kl. Hence, we get 

00 00 ^ 

+0, (17) 



k\ 

i=l k=l 



simplifying which yields 

If only one node transmits in the first non-idle slot, then a success occurs and the node gets 
selected. Selecting one more node will take m^^{pe) slots, on average. (This follows from the 
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memoryless property of the Poisson process [22].) Thus, E x[^^ = m^''(pe)- Also, if exactly 
two nodes transmit in the first non-idle slot, only one node transmits in the slot just after the 



first success. Thus, E 



X. 



(2) 



E 



(2) 



1 = 3 slots. When A; > 3 nodes transmit in the 



first non-idle slot, the following three cases are possible for the next slot: (i) Collision among i 



nodes: E 



(2) 



more slots would then be required, on average, (ii) Idle: E 



(2) 



more slots 



are required, on average, (iii) Success: The next slot would then surely involve a collision among 



k — 1 nodes. E 



(1) 

fc-i 



slots, on average, would be required after that. The probability that i 



nodes transmit in the next slot is (^) /2''. Thus, 



E 



X, 



(2) 




X, 



(2) 



fc-1 



E 



X, 



(1) 

k-l 



i=2 



E 



X 



(2) 



(18) 



Simplifying this further using combinatorial identities [23] results in (|7]). 

Proof of This proof also involves constructing a state transition diagram that will be proved 
to be a Markov chain. Consider the state transition diagram of Figure U\ It is more involved 
than that in Figure [6] because we need to also track how many successes have occurred. State i 
corresponds to the i^^ split before the first success (which takes i slots), state i' corresponds to the 
first success occurring at the ith slot, and state i" corresponds to the first success having already 
occurred by the zth slot. The state transition diagram can be explained in detail as follows. 

State corresponds to the first non-idle slot. If the first non-idle slot is a success, the node 
moves from state to state Si. Now, the algorithm starts a new collision resolution to find 
the second colliding node. This takes time m^^(pe), which is given by Theorem [T] If the first 
non-idle slot is a collision, its threshold interval is split and the node transitions from state 
to state 1. Each subsequent idle or collision results in one additional split and the node moves 
from state z to z + 1. In case of a success, the node moves from from state i to state i' as no 
additional split occurs. A success in state i' results in a transition to state S, at which time the 
algorithm terminates. In case of a collision in state i', the node moves to state {i + 1)", as one 
more split occurs. In case of a success in state i", the node moves to state S, and the algorithm 
terminates. Otherwise, an idle or collision results in a transition from state i" to (i + 1)". Note 
that in each state {i, i', or i") the size of threshold interval is 2~^pe. 

The following Lemma shall prove to be crucial in analyzing this transition diagram. 
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Lemma 3: The state transition diagram of Fig. |7] is a Markov chain. 

Proof: For this, it is sufficient to prove that transition probabilities for each state depend 
only on the i and not on the path taken to reach that state. 

Let Pq be the probability of success in state (the first non-idle slot). It is equal to the 
probability that in a slot of size Pe only one node transmits given that at least one node transmits 
in that slot. Let N{x) = M{t + x) — M{t). Then, by memoryless property of the Poisson process, 
Pr(iV(x) = i) is independent of t and is equal to ^-fp-- Thus, 

Po = Pr (iV(Pe) = 1 iV(Pe) > l) = 'I . (19) 

Let P^ i be the probability of success in state i, which is equal to the probability that given an 
interval of size 2~*pe having more than one nodes, left half of it has exactly one node. Thus, 

P.. = Pr (n m = lN (J^) >l)= ^'Pe^'-'^^i^- e-'-'n (20) 
' V V2V V2^-V / 1 - (1 + 2-(^-i)pe)e-2"''"''p= 

Let j be the probability of success in state i' . State i' can be entered only after success in 

state i. Thus, threshold interval of state i, which is right half of the split during state i — 1, has 

at least one node. Thus Pr j is equal to the probability that exactly one node transmits in the 

slot with interval size 2~*j»e, given that at least one node lies in that interval, which equals 



= Pr ( iV 



iV(^)>l)=^^^^<^. (21) 
* 2V / 1 - e-2->e ^ ^ 



The probability of success in state i" is again equal to the probability that given that an interval 
of size 2~*pe has more than one node, its left half has exactly one node. This probability equals 
Pi j. Thus, from (|20l ) and (|2TI) . the transition probabilities P^ j and Pr j only depend on i, which 
proves that Fig. |7] is a Markov chain. ■ 

Let the random variable / denote the number of slots required until (and including) the first 
non-idle slot and Y denote the number of slots required after that. Then, m)^ (pe) = E [/] +E [F] . 
Again, using Poisson point process interpretation, Pr (I = i) = e~*^*~^''^'=(l— 6"^"=), which implies, 

CXD 

E [/] = V^e-(^-^)^^(l - = 1 (22) 

i=l 

E [Y] can be calculated from Lemma[3]as follows. Let p(i), p'{i), and p"{i) be the probability 
that states i, i', and i" are visited, respectively. From the state transition diagram, since state i 
can be reached only from state i — 1 and state i' can be reached only from state i, we get 
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p(i) = (1 - Po) nj=i(l - PL,j),^i > 1, and = p{i)PL,i,\/i > 1,. Also, since for z > 2 
state i" can be reached from (i — 1)' and (i — 1)", we get 



P"W = P'i^ - - Pi?,^-l) +/(^ - 1)(1 - PL,^-l), V Z > 2, 



(23) 



and 



/(2)=p'(1)(1-Pr,,_i). (24) 

Now, if state Si is visited m^(^{pe) slots, on average, are required, which occurs with prob- 
ability Pq. Else, the average number of slots is equal to YlJLi^^i^ ^ j)' where Z is the total 
number of states visited excluding state 0. This is so because we are counting the number of 
slots required after the first non-idle slot. Since each state is visited at most once, the average 
above is equal to Xli^i Thus, the average number of slots required after 

the first non-idle slot is E [Y] = Pom^cL\pe) + E»=i (p(^) + p' (i) + p" (i)) ■ 



E. Proof of Theorem \3\ 

The proof is similar to the proof of ^ in Theorem [2l except for the following differences: 



1) When two nodes transmit in the first non-idle slots, E 



X 



(2) 



3 slots, on average, are 



required to select both of them. Selecting the remaining best Q — 2 nodes takes another 



m 



(Q-2) 



(pe) slots, on average. Thus, E 



X 



(Q) 



E 



X, 



(2) 



-|- m, 



(Q-2) 



iPe). 



2) When A; > 3 nodes transmit in the first non-idle slot, the average number of slots required 
thereafter is 



E 



X: 



(Q) 



0.5'' 



+ 



1 + E 



1 + E 



X 



X, 



fc-i 



fc-i 

i=2 



1 +E 



X 



(Q) 



(25) 
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Fig. 1. A relay selection system consisting of a sink and n relays/nodes, with a node i possessing a suitability metric m. 




Fig. 2. Average number of slots required to select the best node (moo(pe)) as a function of pe 
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Pe 

Fig. 3. Upper and lower bounds for the average number of slots required to select the best node. 



5.2 




0.5 1 1.221 1.5 2 2.5 3 



Pe 

Fig. 4. Average number of slots required to select the best two nodes (m£^(pe)) as a function pe- 
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TABLE I 

Optimum pe and the average number of slots required to select the best Q relays 



Q 


Optimum 


Optimum m)x^'(pg) (slots) 


Improvement 


1 


1.088 


2.467 




2 


1.221 


4.406 


10.7% 


3 


1.214 


6.491 


12.3% 


4 


1.231 


8.537 


13.5% 


5 


1.236 


10.592 


14.1% 


6 


1.241 


12.645 


14.6% 



/ \ 

/ \ 
/ \ 
/ \ 
/ \ 
/ \ 
/ \ 







pi = 0.2 

1 
I 
1 


P2 = 0.5 


: PS = 0.3 
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Fig. 5. Illustration of Proportional Expansion for discrete metrics. An example shown is for the case where the metric takes 
3 values 1, 2, and 3 with probabilities 0.2, 0.5, and 0.3, respectively. 




Fig. 6. State transition diagram for the number of slots required to select the best node after the first non-idle slot. 
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